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TESTING WHETHER JUMPS HAVE FINITE 
OR INFINITE ACTIVITY 
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We propose statistical tests to discriminate between the finite and 
infinite activity of jumps in a semimartingale discretely observed at 
high frequency. The two statistics allow for a symmetric treatment 
of the problem: we can either take the null hypothesis to be finite 
activity, or infinite activity. When implemented on high-frequency 
stock returns, both tests point toward the presence of infinite-activity 
jumps in the data. 

1. Introduction. Traditionally, models with jumps in finance have relied 
on Poisson processes, as in Merton (1976), Ball and Torous (1983) and Bates 
(1991). These jump-diffusion models allow for a finite number of jumps in a 
finite time interval, with the idea that the Brownian-driven diffusive part of 
the model captures normal asset price variations while the Poisson-driven 
jump part of the model captures large market moves in response to un- 
expected information. More recently, financial models have been proposed 
that allow for infinitely many jumps in finite time intervals, using a variety 
of specifications, such as the variance gamma model of Madan and Seneta 
(1990) and Madan, Carr and Chang (1998), the hyperbolic model of Eberlein 
and Keller (1995), the CGMY model of Carr et al. (2002) and the finite mo- 
ment log stable process of Carr and Wu (2003a). These models can capture 
both small and frequent jumps, as well as large and infrequent ones. 

In this paper, we develop statistical procedures to discriminate empir- 
ically between the two situations of finite and infinite number of jumps, 
while allowing in both cases for the presence of a continuous component 
in the model. While many theoretical models make use of one or the other 
type of jumps, no statistical test has been proposed so far that can iden- 
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tify the most likely type in a given data set, as existing tests have focused 
on the issue of testing for the presence of jumps but not distinguishing be- 
tween different types of jumps; see A'it-Sahalia (2002), Carr and Wu (2003b), 
Barndorff-Nielsen and Shephard (2004), Huang and Tauchen (2005), Ander- 
sen, Bollerslev and Diebold (2007), Jiang and Oomen (2008), Lee and Myk- 
land (2008), A'it-Sahalia and Jacod (2009b) and Lee and Hannig (2010), 
among others. 

The setup we consider is one where a univariate process X is observed on 
a fixed time interval [0, T], at discretely and regularly spaced times iA n . In 
a typical high-frequency financial application, X will be the log of an asset 
price, the length of observation T ranges from, say, one day to one year, while 
the sampling interval A n is small, typically measured in seconds. Assuming 
that the observed path has jumps, we want to test whether there are a finite 
number of jumps or not on that path, two properties commonly referred to 
as "finite activity" or "infinite activity" for the jump component of X. 

Our aim is to provide asymptotic testing procedures, as the time lag A n 
between successive observations goes to 0, allowing to decide which of these 
two hypotheses is more likely; that is, we want to decide in which of the two 
complementary subsets 

Q^, = {oj : 1 \— > Xt (lo) has finitely many jumps in [0, T]}, 

£l l T = {uj : 1 \— > Xt(u) has infinitely many jumps in [0, T]} 

of the sample space O we are in. 

More specifically, we want to find tests with a prescribed asymptotic sig- 
nificance level, and with asymptotic power going to 1, to test the null hy- 
pothesis that uj is in $7^,, and also the symmetric null hypothesis that the 
observed co is in Q Z T . We will need some assumptions on the process X, basi- 
cally that it is an Ito semimartingale. However, we wish to keep the solution 
as nonparametric as possible, and in particular we do not want to specify 
the structure of the volatility or of the jumps. 

The simple idea behind the two test statistics we propose is common with 
our earlier work on testing whether jumps are present or not, or whether a 
continuous component is present. We compute certain power variations of 
the increments, suitably truncated and/or sampled at different frequencies. 
Related methodologies are being utilized by other authors. For example, 
Todorov and Tauchen (2010) use the test statistics of A'it-Sahalia and Ja- 
cod (2009b), study its logarithm for different values of the power argument 
and contrast the behavior of the plot above two and below two in order to 
identify the presence of a Brownian component. Cont and Mancini (2009) 
use threshold or truncation-based estimators of the continuous component 
of the quadratic variation, originally proposed in Mancini (2001), in order 
to test for the presence of a continuous component in the price process. The 
resulting test is applicable when the jump component of the process has fi- 
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nite variation, and a test for whether the jump component indeed has finite 
variation is also proposed. In terms of the Blumenthaf-Getoor index j3, this 
corresponds to testing whether /3 < 1. 

We aim here to construct test statistics which are simple to compute and 
have the desirable property of being model-free. In particular, no feature of 
the dynamics of the underlying asset price, which can be quite complex with 
potentially jumps of various activity levels, stochastic volatility, jumps in 
volatility, etc., need to be estimated in order to compute either the statistic 
or its distribution under the null hypothesis. In fact, implementing the two 
tests we propose requires nothing more than the computation of various 
truncated power truncations. 

We consider two testing problems, one where the null hypothesis is finite 
jump activity and its "dual" where the null hypothesis is infinite jump ac- 
tivity. Under the null hypothesis of finite-activity jumps, the test statistic 
we propose is similar to the simpler statistic S n of Ait-Sahalia and Jacod 
(2009b) which was employed to test for the presence of jumps, with an ad- 
ditional truncation step. An appropriately selected truncation mechanism 
eliminates finite-activity jumps, so that the probability limit of the statistic 
S n post-truncation is the same in this paper as that of the simpler statistic 
in the earlier work, under a purely continuous model. While the result is 
indeed in that case that "the answer is the same," this is not completely 
obvious a priori and still needs to be established mathematically. And the 
commonality is limited to probability limits: the two statistics have different 
asymptotic distributions. 

Under the reverse scenario, where the null hypothesis is that jumps are 
infinitely active, then the statistic S' n we propose for this purpose is radi- 
cally new and so is its asymptotic behavior. That second statistic has no 
relationship to previous work. 

As we will see below, when implemented on high-frequency stock returns, 
both tests point toward the presence of infinitely active jumps in the data. 
That is, in the test where Qj, is the null hypothesis, we reject the null; in 
the test where Q? T is the null hypothesis, we fail to reject the null. This is in 
line with the empirical results of a companion paper, A'it-Sahalia and Jacod 
(2009a), which contains an extension to Ito semimartingales of the classical 
Blumenthal-Getoor index j3 for Levy processes and estimators for /3; see 
also Belomestny (2010) for different estimators. This parameter /3 takes val- 
ues between and 2 and plays the role of a "degree of jump activity" for 
infinitely active jump processes. Then if the estimator of j3 is found to be 
"high" in its range [0,2], with a confidence interval excluding 0, as it is the 
case in the empirical findings of A'it-Sahalia and Jacod (2009a), it is a strong 
evidence against finite activity. However, finite activity implies /3 = 0, but 
the converse fails, so using estimators of (3 can at the best allow for tests 
when the null is "infinite activity," and even for this it does not allow for 
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determining the asymptotic level of the test. Thus in fact the present paper 
and the other one are complementary, both aiming to have a picture as com- 
plete as possible of a continuous-time process which is discretely observed 
at increasing frequencies. Finally we can also mention that here the assump- 
tions are significantly weaker than in A'it-Sahalia and Jacod (2009a), in the 
sense that the test proposed here is nonparametric, where the estimator of 
j3 proposed there is parametric. 

The paper is organized as follows. Section 2 describes our model and the 
statistical problem. Our testing procedure is described in Section 3, and 
Sections 4 and 5 are devoted to a simulation study of the tests and an 
empirical implementation on high-frequency stock returns. Technical results 
are gathered in the supplemental article [A'it-Sahalia and Jacod (2011)]. 

2. The model. The underlying process X which we observe at discrete 
times is a one-dimensional process which we specify below. Observe that 
taking a one-dimensional process is not a restriction in our context since, 
if it were multidimensional, infinitely many jumps on [0,T] means that at 
least one of its components has infinitely many jumps, so the tests below 
can be performed separately on each of the components. In all the paper the 
terminal time T is fixed. However, it is convenient, and not a restriction, to 
assume that the process X is defined over the whole half-line. 

Our structural assumption is that X is an Ito semimartingale on some 
filtered space (Q, J 7 , (J r t)t>o, P), which means that its characteristics (B, C, v) 
are absolutely continuous with respect to Lebesgue measure. B is the drift, 
C is the quadratic variation of the continuous martingale part, and v is the 
compensator of the jump measure /i of X. In other words, we have 
ft 



B t {<jj) = I b s (uj)ds, 
(2) '\ 

Ct(oj) = I a s (ui) 2 ds, v(uj,dt,dx) = dtF t (u),dx). 



Jo 

Here b and a are optional process, and F = F t (uj, dx) is a transition measure 
from Q x endowed with the predictable a-field into M \ {0}. One may 
then write X as 

X t = X + [ b s ds+ [ a s dW s 



(3) 



o Jo 



xl{\ x \<i}(^ - u)(ds,dx)+ I I xl{\ x i >1} n(ds,dx), 



small jumps large jumps 

where If is a standard Wiener process. It is also possible to write the last 
two terms above as integrals with respect to a Poisson measure and its 
compensator, but we will not need this here. 
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The cutoff level 1 used to distinguish small and large jumps is arbitrary; 
any fixed jump size e > will do. In terms of the definition (3), changing the 
cutoff level amounts to an adjustment to the drift of the process. Ultimately, 
the question we are asking about the finite or infinite degree of activity of 
jumps is a question about the behavior of the compensator v near 0. There 
are always a finite number of big jumps. The question is whether there are 
a finite or infinite number of small jumps. This is controlled by the behavior 
of v near 0. 

2.1. The basic assumptions. The assumptions we make depend on the 
null hypothesis we want to test. We start with a very mild (local) bounded- 
ness assumption. Recall that a process at is pre- locally bounded if \at\ <n 
for t <T n , for a sequence (T n ) of stopping times increasing to +oo. 

Assumption 1. The processes b t , a t and F t (R)l^ Ft ^ <00 y and J (x 2 A 
l)F t {dx) are pre-locally bounded. 

In some cases we will need something more about the drift b and the 
volatility a. 

Assumption 2. The drift process bt is cddldg, and the volatility process 
at is an ltd semimartingale satisfying Assumption 1. 

Under this assumption we can write at as (3), with a Wiener process W 
which may be correlated with W . Another (equivalent) way of writing this 
is 

(4) a t = a + [ b s ds+ [ a s dW s + N t + V Act s 1 { | A(Ts | >1} , 

Jo Jo s < t 

where A is a local martingale which is orthogonal to the Brownian motion W 
and has jumps bounded by 1. Saying that a satisfies Assumption 1 implies 
that the compensator of the process [N,N] t + ^2 s< t l{|Ao- s |>i} has the form 

Jq n s ds, and the processes bt and nt are pre-locally bounded, and the process 
at is cadlag. 

Next, we need conditions on the Levy measures Ft, which are quite 
stronger than what is in Assumption 1 . We state here a relatively restrictive 
assumption. 

Assumption 3. The Levy measure Ft = Ft(oj,dx) is of the form 
Ft(dx) = ^^P^(a^\ n <(+H + ol-»L ( - }< Jdx 

ly ' \ x 1+7* v ' {0<x<zy>} 1 {-z\ '<x<0}> 

(5) 

+ Fl(dx), 

where, for some pre-locally bounded process Lt>l: 
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(i) a[ + \ a t , z\ + ^ and z\ ^ are nonnegative predictable processes sat- 



isfying 



(6) ^<4 +) <l, ^<2 t () <l, Af^a^ + a{ ] <L U 

(ii) 7j, 7^ and St are predictable processes, satisfying for some constant 
<5>0 

0<7t<2-l/Lt, \6 t \<L t , 7t = S t = 5, 

(7) 

j_)lu iflt>0, 
.1, */7t = 0, 

(iii) F[ = F((uj,dx) is a signed measure, whose absolute value \F(\ satis- 
fies, for some increasing continuous function <j) : [0, 1] i— > [0, 1] with (f)(0) = 
and some constant c £ [0, 1): 

7t = or A t = 0, x>0 =► \F[\([-x,x]) < L t (j)(x A 1), 

(8) 

7< >0 and A t > => /(|xP A (ds) < L t . 



It 



Equivalently, one could take 7^ = 1 identically, provided in (6) At < Lt is 
substituted with A t (l{ 7t=0 } + ^l{ 7t >o}) < £f- 

Since F/ is allowed to be a signed measure, (5) does not mean that F t (dx) 

restricted to (0,^ + ^, say, has the density f^ + \x) = a| + ' > 7^(log(l/|j;|)) 5t /|x| 1+7t ; 
it simply means that the "leading part" of Ft on a small interval (0,e] has 

a density behaving as f t (x) as x 4-0, and likewise on the negative side. 

In all models with jumps of which we are aware in financial economics, 
such as those cited in the first paragraph of the Introduction, the Levy 

measure has a density around 0, which behaves like aj ± ^(log(l/|x|))' s */|:r| 1+7t 
as x 4, or x f (in most cases with 74 and St constant) . Thus all these models 
satisfy Assumption 3. For instance, it is satisfied if the discontinuous part 
of X is a stable process of index j3 G (0,2), with jt = P and St = and 

z^ = 1, and af 1 ^ being constants; in this case the residual measure F[ is 
the restriction of the Levy measure to the complement of [—1,1], and (8) 
holds for any cE (0,1). When the discontinuous part of X is a tempered 
stable process the assumption is also satisfied with the same processes as 
above, but now the residual measure F[ is not positive in general, although 
it again satisfies (8) with any cG (0,1). Gamma and two-sided Gamma 

processes also satisfy this assumption, take 74 = and St = and z[ = 1, 

and being constant. 

This assumption also accounts for a stable or tempered stable or Gamma 
process with time- varying intensity, when jt, St and z^ are as above, but 
a[ are genuine processes. It also accounts for a stable with time- varying 
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index process, as well as for X being the sum of a stable or tempered sta- 
ble process with jump activity index f3 plus another process whose jumps 
have activity strictly less than f3. Furthermore, any process of the form 
Yt = Yq + |||iu s dX s satisfies Assumption 3 as soon as X does and wt is lo- 
cally bounded and predictable. As is easily checked (see Section 1 of the 
supplemental article [Ai't-Sahalia and Jacod (2011)] for a formal proof), un- 
der Assumption 3 the set Vl l T of (1) is (almost surely) 

(9) Q T = {A T >0} where A t = [ A s ds. 

Jo 

The previous assumption is designed for the test for which the null is 
"finite activity." For the symmetric test, the assumption we need is stronger: 

Assumption 4. We have Assumption 3 with jt = P [a constant in (0, 2) /, 
and 5t = 0. 

The reason we need a stronger assumption under the null of infinite jump 
activity is that the asymptotic distribution of the test statistic under the 
null is now driven by the behavior of Ft near 0, whereas in the previous 
situation where the null has finite jump activity it is the Brownian motion 
that becomes the driving process for the behavior of the statistic under the 
null. 

Assumptions 3 and 4 have the advantage of being easily interpretable 
and also easy to check for any concrete model. But as a matter of fact it is 
possible to substantially weaken them, and we do this in the next subsection. 
The reader who is satisfied with the degree of generality of Assumption 3 
can skip the next subsection and go directly to the description of the tests 
in Section 3. 

2.2. Some weaker assumptions. For a better understanding of what fol- 
lows, let us first recall the notion of Blumenthaf-Getoor index (in short, BG 
index). There are two distinct notions. First, we have a (random) global BG 
index over the interval [0, t] defined by 



(10) 



r t = inf^p>0:^ ds J (\x\ p A l)F s (dx) < oo 



This is a nondecreasing [0, 2]-valued process. It is not necessarily right- 
continuous, nor left-continuous, but it is always optional. Second, we have 
an instantaneous BG index jt, which is the BG index of the Levy measure 
Ft, defined as the following (random) number: 

(11) 7 t = infU>0: J {\x\ p A l)F t (dx) < oo^j , 

which necessarily belongs to [0,2]. As a process, jt is predictable. The sym- 
metrical tail function Ft(x) = Ft((—x,x) c ) of Ft (defined for x > 0) satisfies 
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for all oj and t: 

p > 74 =4> lim x p F t (x) = 0, 

(12) 

p < 7t =>■ lim sup x^Ff (x) = oo . 

ae-K) 

In the latter case, x p Ft(x) does not necessarily converge to oo when x — >■ 0. 
When x p Ft(x) — > oo as x — >• for all p < 7t, or if = 0, we say that the 
measure is regular: this is the case when, for example, the function Ft is 
regularly varying at 0. 

The connections between these two indices are not completely straightfor- 
ward; they are expressed in the next lemma, where AX S denotes the jump 
of X at time s: 

Lemma 1. Outside a T-null set, we have for all t: 

(13) V t =inf(p>0:^|AX s | p <oo). 

^ s<t ' 

Moreover if A denotes the Lebesgue measure we have, outside a F-null set 
again, 

(14) r t (w) > A-esssup( 7s (w):sG [0,t]), 

and this inequality is an equality as soon as sup sg r 0jt i )a . e ( ,i] x yB+e F s (x) < oo 
for all e > 0. 

Our general assumption involves two functions with the following prop- 
erties [4> is indeed like in (8)]: 

4> : [0, 1] — > [0, oo) is continuous increasing with 0(0) = 0, 

ijj : (0, 1] — > [1, oo) is continuous decreasing with ^(0) := lim^o 
being either 1 [then ip(x) = 1 for all x], or +oo, in which case 
, x e ip{x) = as x — > for all e > 0. 



(15) 



Assumption 5. The global BG index Tt takes its values in [0,2), and 
there are a constant a € (0,1], two functions eft and ift as in (15) and pre- 
locally bounded processes L(e) and L'(p), such that for all e > and p>2 
and F® X-almost all (uj,t) we have 



are (0,1] => x Tt+£ F t (x) < L{e) 



t 

{ L{e) 



(16) x, yG (0,1] => F t (x)-F t {x{\ + y))<< 



if r t >o, 

^( y + x r ^0(x)), 

*/r t = o, 
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0<u<l, r i = / \x\ p F t (dx) <L'(p) t u p i/j(u). 

J{\x\<u} 

This assumption looks complicated, but it is just a mild local bounded- 
ness assumption, which is made even weaker by the fact that we use the 
global BG index Tt instead of the (perhaps more natural) instantaneous 
index 7 t . 

Now we introduce the set which will be the alternative for our first test 
when the null is "finitely many jumps." This set has a somewhat complicated 
description, which goes as follows, and for which we introduce the notation 
(for p> 2): 

(i7) G ^ t= ^u)f \*\ p mx)- 



{\x\<u} 



Then we set 



(18) n% = nlf >0 U ttif- where 

,r>o 



Q T = the set on which Tt > and, for all a' G (0, 1), 

(19) A ({ sG [0,T]:limx a ' TT F s (x) = oo^ > 0, 

Q,ijF~ = the set on which Tt = and, for all p > 2, 

(20) A ({ sG [°' T ] :]in ^G(p,u) s > o}) >0. 

We will see in Section 1 of the supplemental article [Ait-Sahalia and Jacod 
(2011)] that C Q T , but equality may fail. In view of Lemma 1, and if the 
measures F t are regular [see after (13)], the set Q T r>0 is equal to {IV > 0}, 
which is also the set where there are infinitely many jumps due to a positive 
BG index. The interpretation of the set f2^ r_0 is more delicate: observe that 
G(p, u) t < L'{p) t by (14); then saying that u £ Q, T T ~° amounts to saying that 
for "enough" values of t < T the variables G(p,u)t are not small. 

The following is proved there: 



Lemma 2. Assumption 3 implies Assumption 5, and we have {l l T = Q T 



a.s. 



As for Assumption 4, it can be weakened as follows: 

Assumption 6. There are two constants /3 G (0, 2) and ft G [0,/3) and a 
pre-locally bounded process L%, such that the Levy measure Ft is of the form 

(21) F t (dx) = J^{at ] \ 0<x ^ } +4-h { _ 4 -^ x<0} )d X + F{(d X ) 
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[the same as (5) with St = and *yt = PJ with (6) and the (signed) measure 
F[ satisfying 

(22) f{\xfM)\F[\{dx)<L t . 

We associate with this assumption the following increasing process and set: 

(23) A t = [ A s ds, ni^ = {A T >0}, 

Jo 

where the exponent in f2j? stands for "infinite activity for the jumps asso- 
ciated with the part of the Levy measure having index of jump activity /?," 
and we have C £l T , up to a null set. Again, the set £l T \ is not 
necessarily empty. 

Assumption 4 obviously implies Assumption 6, with the same f3 and with 
ft' = c(3, and in this case fltp = Q T . However, Assumption 6, which is exactly 
the assumption under which the estimation of the BG index is performed in 
A'it-Sahalia and Jacod (2009a), does not require the measure F[ to be finite 
when At = 0, so it is weaker than Assumption 4. 

3. The two tests. 

3.1. Defining the hypotheses to be tested. Ideally, we would like to con- 
struct tests in the following two situations: 

Hq:VlL vs. Hi:Q t , 

(24) 

H :n T vs. 

In order to derive the asymptotic distributions of the test statistics which we 
construct below, we need to slightly restrict these testing hypotheses. Besides 
the sets defined in (18) and (23), we also define two other complementary 
sets: 

(25) n T / = S^£a 2 s ds>0^, fl T oW = U o*ds = 

That is, Qj? is the set on which the continuous martingale part of X is not 
degenerate over [0,T], and the exponents in VlYp and £l T oW stand for "the 
Wiener process is present," and "no Wiener process is present," respectively. 

We will provide tests for the following assumptions (below we state the 
assumptions needed for the null to have a test with a given asymptotic level, 
and those needed for the alternative if we want the test to have asymptotic 
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u o/^oW/a i j u / f2y (Assumptions 1 and 3), 

H ■ il T n (Assumptions 1 and 2) vs. ffi : ' 



power equal to 1): 

Accnmntinnc 1 anrl 9 1 vo. • J 

f2S (Assumptions 1 and 5), 
(26) ~ 

H [VSr (Assumptions land 6), ^ Hl nf2^ (Assumptions 1). 
|^ fij, (Assumptions 1 and 4) 

Note that this lets aside the two sets ttj, n and fi^ \ in the null 

hypothesis: in the context of high-frequency data, no semimartingale model 
where n Q™ w is not empty has been used, to our knowledge. Indeed, 
on this set the path of X over the interval [0, T] is a pure drift plus finitely 
many jumps. It also lets aside the set VL l T \ fi^, under Assumptions 1 and 6, 
which is more annoying. Note that the set £l J T contains those u> for which 
X(uj) is continuous on [0,T], although we would not test against infinite 
activity if we did not know beforehand that there were some jumps. 

Next, we specify the notion of testing when the null and alternative hy- 
potheses are families of possible outcomes. Suppose now that we want to 
test the null hypothesis "we are in a subset Oo" of fi, against the alternative 
"we are in a subset Hi," with of course fioH^i =0. We then construct a 
critical (rejection) region C n at stage n, that is, when the time lag between 
observations is A n . This critical region is itself a subset of Q, which should 
depend only on the observed values of the process X at stage n. We are not 
really within the framework of standard statistics, since the two hypotheses 
are themselves random. 

We then take the following as our definition of the asymptotic size: 



(27) a = sup(limsupP(C n , | A) : A G F, A C SI 



Here P(C n | ^4) is the usual conditional probability knowing A, with the 
convention that it vanishes if f(A) = 0. If P(^o) = 0, then a = 0, which is 
a natural convention since in this case we want to reject the assumption 
whatever the outcome oj is. Note that a features a form of uniformity over 
all subsets AgQq. As for the asymptotic power, we define it as 



(28) P = inf (hm inf ¥(C n | A) : A £ T, A C Qi , P(A) > J 

Again, this is a number. 

Clearly, and as in all tests in high-frequency statistics, at any given stage n 
it is impossible to distinguish between finitely many or infinitely many jumps 
(or, for that matter, between a continuous and a discontinuous path). So 
testing such hypotheses can only have an asymptotic meaning, as the mesh 
A n goes to 0. Now, our definition of the asymptotic level is the usual one, 
apart from the fact that we test a given family of outcomes rather than a 
given family of laws. For the asymptotic power, it is far from the typical 
usual statistical understanding. Namely saying that P = 1, as will often be 
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the case below, does not mean anything like the infimum of the power over 
all possible alternatives is 1; it is rather a form of consistency on the set of 
alternatives. 

3.2. Truncated power variations. Before stating the results, we introduce 
some notation, to be used throughout. We introduce the observed increments 
of X as 

( 29 ) A i X = X iA n ~ X (i-l)A n , 

to be distinguished from the (unobservable) jumps of the process, AX S = 
X s — X s ~. In a typical application, X is a log-asset price, so A™X is the 
recorded log-return over A n units of time. 

We take a sequence u n of positive numbers, which will serve as our thresh- 
olds or cutoffs for truncating the increments when necessary, and will go to 
as the sampling frequency increase. There will be restrictions on the rate 
of convergence of this sequence, expressed in the form 

(30) supA£/V<oo 

n 

for some p > 0: this condition becomes weaker when p increases. Two specific 
values for p, in connection with the power p > 2 which is used below, are of 
interest to us: 

(31) pi (p) = __ p2 ( p )= — A 



2p 1 ' XL ' 4p-4 lip -10 

These quantities increase when p increases [on (2, oo)], and p±(p) > pi(p) > 0. 
Finally, with any p > we associate the increasing processes 

[t/A n ] 

(32) B(p,u n ,A n ) t = ]T \A?X\n {lA n Xl < Un} 

i=i 

consisting of the sum of the pth absolute power of the increments of X, 
truncated at level u n , and sampled at time intervals A n . These truncated 
power variations, used in various combinations, will be the key ingredients 
in the test statistics we construct below. 

3.3. The finite- activity null hypothesis. We first set the null hypothesis to 
be finite activity, that is, Slo = n^nfiy , whereas the alternative is Q\ = £l l T . 

We choose an integer k > 2 and a real p > 2. We then propose the test 
statistic, which depends on k, p, and on the truncation level u n , and on the 
time interval [0,T], defined as follows: 

,oox Q _ B(p,u n ,kA n ) T 

B(p,u n ,A n ) T 

That is, we compute the truncated power variations at two different frequen- 
cies in the numerator and denominator, but otherwise use the same power 
p and truncation level u n . Since k is an integer, both truncated power vari- 
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ations can be computed from the same data sample. If the original data 
consist of log-returns sampled every A n units of time, then sampling every 
kA n units of time involves simply retaining every kth observation in that 
same sample. 

The first result gives the limiting behavior of the statistic S n , in terms of 
convergence in probability: 

Theorem 1. (a) Under Assumption 1 and if the sequence u n satisfies 
(30) with some p < 1/2, we have 

(34) S n A k vl2 ~ x on the set £l f T n 9^. 

(b) Under Assumptions 1 and 3 (resp., Assumption 5) and if the sequence 
u n satisfies (30) with some p< p\(p), we have 

(35) S n -"-)• 1 on the set Q, l T (resp., OJ^). 

As the result shows, the statistic S n behaves differently depending upon 
whether the number of jumps is finite or not. Intuitively, if the number of 
jumps is finite, then at some point along the asymptotics the truncation 
eliminates them and the residual behavior of the truncated power variation 
is driven by the continuous part of the semimartingale. More specifically, 

B(p,u n , A n )x is of order O p (A^/ 2 1 A(p)t) where A(p)x = m p \a s \ p ds is 
the continuous variation of order p and m p is a constant defined below. It 
follows from this that the ratio in S n has the limit given in (34) since the 
numerator and denominator tend to zero but at different rates. 

By setting p < 1/2 in the truncation rate, we are effectively retaining 
all the increments of the continuous part of the semimartingale, and so we 
indeed obtain the "full" continuous variation A(p)t after truncation. Note 
that by contrast, the untruncated power variation converges when p > 2 to 
the discontinuous variation of order p, namely B(p)x = ^ S<T |AX S | P , and 
so we would not have been able to distinguish finite or infinite jump activity 
without truncation, as long as jumps (of any activity) are present. 

If the jumps have infinite activity, on the other hand, that is, under the 
alternative hypothesis, the asymptotic behavior of the truncated power vari- 
ation is driven by the small jumps, whether the Brownian motion is present 
or not, and the truncation rate matters. We will see, for example, that, un- 
der Assumption 4 or 6, the truncated power variation is of order O p [v^ ). 
Since here we are truncating at the same level u n for the two sampling fre- 
quencies A n and kA n , this fact has no consequence on the behavior of the 
ratio S n , which tends to 1 as stated by (35) even when Assumption 6 fails. 

Theorem 1 implies that for the test at hand an a priori reasonable critical 
region is C n = {S n < c n }, for c n between 1 and k v l 2 ~ l , and in particular if 
we choose c n = c in the interval (1, k p l 2 ~ l ) the asymptotic level and power 
are, respectively, and 1 if the model satisfies Assumption 5. 
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For a more refined version of this test, with a prescribed level 06 (0, 1), 
we need a central limit theorem associated with the convergence in (34), 

£— (s) 

and a standardized version goes as follows [we use — > to denote the stable 
convergence in law; see, e.g., Jacod and Shiryaev (2003) for this notion]: 

Theorem 2. Assume Assumptions 1 and 2, and that the sequence u n 
satisfies (30) with some p< 1/2. Then 

(36) (S n - k p/2 ~ l ) I^JV n Af (0, 1) in restriction to flf n Q$ , 
where 

B(2p,u n ,A n ) T 



(37) V n = N(p,k) 



(B(p,u n ,A n ) T y 



(38) N(p, k) = — (k p ~ 2 {l + k)m 2p + kP~ 2 (k - l)ml - 2P/ 2 " 1 m fc A 
m 2p v 



and m M = E(\U\ P \U + Vk^lV\ p ) and m p = E(\U\ P ) for U and V two in- 
dependent A/"(0, 1) variables. 

Note that to implement this asymptotic distribution in practice, and hence 
the test, we simply need to compute truncated power variations for various 
powers (specifically p and 2p, in this case) and at two different sampling 
frequencies (A n and kA n ). No other estimation is required. In particular, 
we do not need to estimate any aspect of the dynamics of the X, such as 
its drift or volatility processes or its jump measure. In that sense, the test 
statistic is nonparametric, or model-free. 

We are now ready to exhibit a critical region for testing H$ : n Qjf vs. 
Hi'.nip, or $7^;, or f^ r>0 (depending on the assumptions), using S n with 
a prescribed asymptotic level a E (0,1). Denoting by z a the a-quantile of 
JV(0, 1), that is, F(U > z a ) = a where U is JV(0, 1), we set 

(39) C n = {S n <k p l 2 ~ l -Za^/Vn}. 

More precisely, we can state the level and power of the test under selected 
alternatives as follows: 

Theorem 3. Under Assumptions 1 and 2, and if the sequence u n satis- 
fies (30) with some p < 1/2, the asymptotic level of the critical region defined 
by (39) for testing the null hypothesis "finite jump activity" (i.e., Q^DQj( ) 
equals a. Moreover the asymptotic power of this test is 1 in the following 
cases: 

• under Assumption 3 with 5 > 0, for the alternative Vt l T , 

• under Assumption 5 with ip(0) = 00, for the alternative Q^, 

• under Assumption 5, for the alternative 0,^ 
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Of course this test has the same asymptotic level a for testing Qj, n 

against its complement (& T C\£lYp) c , under Assumptions 1 and 2; but in this 
case the asymptotic power is probably not 1, and may well be 0. 

3.4. The infinite- activity null hypothesis. In the second case, we assume 
Assumption 6 (or Assumption 4) and we set the null hypothesis to be infinite 
activity, that is, fio = fij? (or Q l T ), whereas the alternative is fii = n 
Qjf. Unfortunately, we cannot simply use the same test statistic S n we 
proposed in (33) for this second testing problem. The reason is that, while 
the distribution of S n is model-free under the null hypothesis of the first 
testing problem, it is no longer model-free under the null hypothesis of the 
second testing problem: its distribution when jumps have infinite activity 
depends upon the degree of jump activity, f3. While it is possible to estimate 
f3 consistently [see A'it-Sahalia and Jacod (2009a)], it would be preferable to 
construct a statistic whose implementation does not require a preliminary 
estimate of the degree of jump activity. 

In order to design a test statistic which is model-free under the null of 
infinite activity, we choose three reals 7 > 1 and p' > p > 2 and then define 
a family of test statistics as follows: 

/ 40 ^ s , = B(p\ ju n , A n ) T B(p, u n , A n ) T 

n B( P / ,u n ,A n ) T B( P ^u n ,A n ) T ' 
In other words, unlike the previous statistic S n , we now play with different 
powers p and p', and different levels of truncation u n and 7« n , but otherwise 
sample at the same frequency A n . Once more, the first result states the 
limiting behavior of S' n in terms of convergence in probability: 

Theorem 4. Assume Assumptions 1 and 4 (resp., Assumptions 1 and 6). 

(a) If the sequence u n satisfies (30) with some p< pi{p), we have 

(41) S' n A 7 P '" P on the set (resp., Q^). 

(b) If the sequence u n satisfies (30) with some p< 1/2, we have 

(42) S' n -A 1 on the set o£ fl Ct^. 

That is, as was the case in Theorem 1, the test statistic S' n tends to 1 under 
the alternative and to a value different from 1 under the null. Intuitively, 
under the alternative of finite jump activity, the behavior of each one of the 
four truncated power variations in (40) is driven by the continuous part of the 
semimartingale. The truncation level is such that essentially all the Brownian 
increments are kept. Then the truncated power variations all tend to zero 

at rate O p (A^ 2 l ) and O p (A^ 2 1 ), respectively, and by construction the 
(random) constants of proportionality A(p)t and A(p')x cancel out in the 
ratios, producing the limit 1 given in (42). 
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If, on the other hand, jumps have infinite activity, then the small jumps 
are the ones that matter and the truncation level becomes material, pro- 
ducing four terms that all tend to zero but at the different orders O p (un ), 
O p (un~^), O p (('ju n ) p ~P) and O p (('yu n ) p '~ 13 ), respectively, resulting in the 
limit (41). By design, the limit is independent of (3. 

A reasonable critical region is C n = {S' n < c n } with c n between 1 and 
7 P ~ p , and if c n = c is in the interval (l,j p ~ p ) the asymptotic level and 
power are, respectively, and 1. For a test with a prescribed level we again 
need a standardized central limit theorem associated with the convergence 
in (41): 

Theorem 5. Assume Assumptions 1 and 4 with c < 1/2 (resp., As- 
sumption 6 with (3' < 13/2), and if the sequence u n satisfies (30) with some 
P < P2(p), we have 

(43) ( ( S;-7 P '^)/yT^ £ ^l ) AA(0,l) in restriction to (resp., Stiff), 
where 

B(2p,u n ,A n ) T B{2p,^u n ,A n ) T 



B(p,u n ,A n ) T ) 2 (B(p,-/u n ,A n ) T ) 2 

B(2p',u n ,A n ) T , B(2p' \jun, A n ) T 

+ (1 — 27 



(44) 



-2 



(B(p',u n ,A n ) T ) 2 (B(p', 1Un ,A n ) T 

B(p + p',u n ,A n ) T 
B(p, Un, A n ) T B(p', Un, A n ) T 

2(l- 7 -P_ 7 -*A B(p + p',~fu n ,A n ) T 



B(p, 7u n , A n ) T B(p', 7u n , A n ) T 



As was the case for S n in Theorem 2, the asymptotic distribution of S' n 
under the null is again model-free. While the expression (44) looks compli- 
cated, implementing it simply requires the computation of truncated power 
variations B of order p, p' , p + p', 2p and 2p' , and at truncation levels u n 
and 7ii n . No other aspects of the dynamics of the X process, such as its 
degree of jump activity f3, for instance, need to be estimated. 

The critical region for testing i^o : or vs - ^1 ■ usm g S' n with a 
prescribed asymptotic level a £ (0, 1) will be 

(45) C' n = {S' n < 7 p '~ p -ZaVK}- 

We can state more precisely the level and power of the test as follows: 

Theorem 6. Under Assumptions 1 and 4 with c < 1/2 (resp., Assump- 
tion 6 with f3' < (3/2), and if the sequence u n satisfies (30) with some p < 
P2(p)> the asymptotic level of the critical region defined by (45) for testing 
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the null hypothesis "infinite activity for jumps" (i.e., Q l T , resp., Vtj, , against 
Qj, fl Qjf ) equals a. Moreover the asymptotic power of this test is 1. 

3 /2 

Under the null hypothesis the rate of convergence is l/i% (contrary to 
the situation of Theorem 3, where the rate was 1 jyj A n whatever j3 and u n 
were). So, although asymptotically u n does not explicitly show in the test 
itself, one should probably take u n as small as possible (we have no choice 
as to P, of course). That is, one should take p as large as possible, which in 
turn results in choosing p as large as possible [recall (31)]. We discuss actual 
choices in practice in Sections 4 and 5 below. 

3.5. Micro structure noise. In practice, the tests presented here need a lot 
of data to be effective, that is, we need a high sampling frequency. However, 
in this case, it is well established that the so-called micro structure noise 
may be a relevant consideration, and in some cases may even dominate at 
ultra high frequencies. It is outside the scope of this paper to provide a 
complete analysis of what happens when noise is present, or to contemplate 
constructing effective testing procedures in the presence of noise. However, 
as a first step, it may be enlightening to determine at least the limiting 
behavior (in probability) of our test statistics in the presence of noise since 
this may help guide the interpretation of the empirical results when the test 
is implemented in practice. 

We start first with pure additive noise, which is the type of noise consid- 
ered by much of the literature, primarily for reasons of tractability, although 
it may not account very well for the microstructure noise encountered in 
practice. It gives, however, an insight on what can happen in the presence of 
noise. In this situation, at any given observation time t we actually observe 
log-returns perturbed by a noise term Et in 

(46) Z t = X t + e t . 

To avoid intricate statements, we make a few basic, and mild, assumptions 
on the noise process et'. those variables are i.i.d., centered, independent of 
the underlying process, with the following property: 

if s 7^ t the variable Et — £ s has a density /, continuous and 
positive at 0. 

Note that no moment condition is required, except E(et) = 0, because we 
consider only truncated increments. 

We write B(p,u n ,A n ) for the variables introduced in (32) if we replace 
XiA n by ZiA n , and likewise for S n and S n if we do the same substitution in 
(33) and (40) (those are associated with some integer k and some numbers 
p,p' > 2 and 7 > 1). Under the above assumption on the noise, we have the 
following: 
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Theorem 7. Let the sequence u n satisfy (30) with some p< 3- Then, 
under Assumption 1 we have 

(48) K^1 P '- P - 

It is remarkable that the assumptions in this theorem are much weaker 
than in the previous results, as far as X is concerned. This is of course 
because the noise, when present, becomes the prevalent factor. This has the 
important consequence that the statistics S n and S n are no longer able to 
distinguish between the two hypotheses of finite or infinite activity when 
noise dominates. When we use S n we get a limit which differs from both 
limits in Theorem 1, so when we test for the null hypothesis of finite activity 
and the empirical value of S n is close to 1/k we can in principle tell that this 
is due to the noise, whereas if it is close to kP^ 2 " 1 or to 1 this is probably 
due to finite activity, or infinite activity. When we use S n the situation is 
worse, because noise plus finite activity leads to accept the hypothesis that 
we have infinite activity whether the jumps have finite or infinite activity. 

Alternatively, it might be closer to the reality to model the microstructure 
noise as a pure rounding noise. There, instead of observing Xi& n , we observe 
PQaJch that is, XjA„ rounded to the nearest multiple of a, where a is the 
tick size: typically a = 1 cent for a decimalized stock, or a = l/32nd of a 
dollar, for bond prices. Let us denote by B(p,u n , A n ) t the variables of (32) 
when we replace AjA n by [A^aJ«- Theorems 1 and 4 were based on limit 
theorems for the truncated power variations B(p,u n , A n ) t . In the rounded 
case we indeed have something radically different: as soon as u n < a, we 
eliminate all increments, because increments are multiples of a. Therefore 
B(p,u n ,A n )t = for all n large enough, and of course the ratios (33) and 
(40) make no sense. It follows that in the case of a rounding noise, the 
statistics proposed in this paper are totally meaningless. And as a matter 
of fact, even when the whole (rounded) path of X is observed, we cannot 
decide whether we have finitely many or infinitely many jumps. This means 
that not only our statistics do not work for this problem, but there cannot 
exist asymptotically consistent tests for this problem. 

Naturally, these two idealized descriptions of microstructure noise do not 
exhaust the possibilities for modeling the noise. One can, for example, use 
a mixed model which mixes additive noise and rounding, or more general 
forms, as in Jacod et al. (2009), for example. At present, however, it is not 
clear how our statistics theoretically behave in these more general cases, nor 
how to construct asymptotically consistent tests, nor even if such tests exist 
at all. 

4. Simulation results. We now report simulation results documenting 
the finite sample performance of the test statistics S n and S' n in finite sam- 
ples under their respective null and alternative hypotheses. We calibrate the 
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values to be realistic for a liquid stock trading on the NYSE. We use observa- 
tion lengths ranging from T = 1 day, consisting of 6.5 hours of trading, that 
is, n = 23,400 seconds, to T = 1 year. The sampling frequencies we consider 
range from A n = 1 second to 5 minutes. The highest sampling frequencies 
serve to validate the asymptotic theory contained above, while the lower fre- 
quencies serve as proxies for situations where sparse sampling is employed 
as a means to reduce the adverse impact of market microstructure noise. 

The tables and graphs that follow report the results of 5,000 simula- 
tions. In order to validate the asymptotic theory developed above, we start 
with the highest sampling frequency of A n = 1 second. Below, we will ex- 
amine the accuracy of the tests as a function of the sampling frequen- 
cies ranging from 5 seconds to 5 minutes. The data generating process is 

1/2 

the stochastic volatility model dX t = atdWt + 6dY t , with at =v t , dvt = 
X (v ~ v t ) dt + iv] 12 dB t + dJ t , E(W t B t ) = pt, ?? V2 = .25, f = 0.5, x = 5, 
~p= — 0.5, J is a compound Poisson jump process with jumps that are uni- 
formly distributed on [—30%, 30%] and Xq = 1. The jump process Y is either 
a /3-stable process with /3 = 1, that is, a Cauchy process (which has infinite 
activity, and will be our model under ) or a compound Poisson process 
(which has finite activity, and will be our model under Qj,). In the infinite- 
activity case, empirical estimates of f3 reported in A'it-Sahalia and Jacod 
(2009a) are higher than 1; simulation results using /3-stable processes with 
such values are qualitatively similar. In the finite-activity case, the jump 
size of the compound Poisson process is drawn from a truncated Normal 
distribution with mean and standard deviation 0.10, designed to produce 
jumps greater in magnitude than 0.05. The estimator is implemented with 
a truncation rate u n = aA" where w = 0.20. In the results, we report the 
level of truncation u n indexed by the number a of standard deviations of 
the continuous martingale part of the process, defined in multiples of the 
long-term volatility parameter j/ 1 / 2 . That is, a is such that u n = (xq 1 / 2 A^/ 2 . 

The statistic S n is implemented with k = 2 and p = 4. When p = 4, the 
constants appearing in (38) are 

1 fi 

(49) N{A,k) = —k(2k 2 -k-l). 

For the second test, S' n is implemented with p' = 4, p = 3 and the second 
truncation level jUn related to the first according to a' = 2a, that is, 7 = 2. 

Given 77, the scale parameter 9 [or equivalently At in (6), which here is a 
constant] of the stable process in simulations is calibrated to deliver different 

various values of the tail probability TP = P(| AYj | > 47/ 1 / 2 A}/ 2 ) reported in 
the tables as low (TP = 0.01), medium (TP = 0.05) and high (TP = 0.10). 
For the Poisson process, it is the value of the arrival rate parameter A that 
is set to generate the desired level of jump tail probability as low (A = 
2/23,400), medium (A = 10/23,400) or high (A = 50/23,400). In the various 
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Table 1 

Monte Carlo rejection rate for the test of finite activity using the statistic S, 
(testing Ho : 0£ (~1 £1? vs - Hi : Q, 1 ^ using S n ) 



Finite Test theo- Test empirical level for a degree of truncation a 



.1 t' — 

tensu v 


level 


6 


7 


8 


9 


10 


12 


15 


Low 


10% 


9.6% 


9.6% 


9.5% 


9.5% 


9.5% 


9.5% 


9.5% 




5% 


5.6% 


4.5% 


4.4% 


4.4% 


4.4% 


4.4% 


4.4% 


Medium 


10% 


11.5% 


10.7% 


10.5% 


10.5% 


10.5% 


10.4% 


10.4% 




5% 


6.1% 


5.0% 


5.0% 


5.0% 


5.0% 


5.0% 


5.0% 


High 


10% 


11.8% 


10.4% 


10.3% 


10.3% 


10.3% 


10.2% 


10.2% 




5% 


5.9% 


5.1% 


5.0% 


5.0% 


5.0% 


5.0% 


5.0% 



simulations' design, we hold rj fixed. Therefore the tail probability parameter 
controls the relative scale of the jump component of the semimartingale 
relative to its continuous counterpart. 

Table 1 reports the Monte Carlo rejection rates of the test of Hq : fij, n 
vs. Hi : at the 10% and 5% level, using the test statistic S n , for various 
levels of truncation a. Recall that for concreteness a is expressed as a number 
of standard deviations of the Brownian part of X. We find that the test 
behaves well, with empirical test levels close to their theoretical counterparts, 
and so for a wide range of values of a. 

The limit in probability of S n is k p / 2 ~ l = 2 with our choice of k and p, pro- 
vided u n / \/ A n — > oo. With our notation a denoting the threshold expressed 
as the number of (normalized) standard deviations of the Brownian part, 
this means that this limit holds when a = a n goes to infinity In practice we 
choose a between 6 and 15, and this introduces a bias. To evaluate this bias, 
in Figure 1, we plot as a function of a the limiting value of S n under Hq, 

Behavior of S Under the Null Hypothesis 
H : Finite Jump Activity 



Brownian + Compound Poisson 
Theoretical Behavior of S 



3rownian + Compound Poisson 
Monte-Carlo Behavior of S 



Fig. 1. Behavior of the test statistic 8 n under Ho : £l T l~l £ljf for varying degrees of trun- 
cation a. 
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Behavior of S Under the Alternative Hypothesis 
Hi : Infinite Jump Activity 



Brownian Alone 
Theoretical Behavior of S 

s 



Brownian + Cauchy 
Theoretical Behavior of S 



Cauchy Alone 
Theoretical Behavior of S 

s 



10 



15 



20 



Brownian + Cauchy 
Monte-Carlo Behavior of S 

s 



: « i i ■ ; ; i i • i • 



Fig. 2. Behavior of the test statistic S n under Hi for varying degrees of trunca- 
tion a. 



when a stays fixed instead of increasing to infinity, and compare it with the 
theoretical limit 2 (in the simple model used for the simulations it is possible 
to numerically compute it with any desired accuracy); this is the left graph, 
and on the right graph we draw the corresponding average value of S n from 
the Monte Carlo simulations, both as functions of a. It is worth noting that 
the behavior of S n is driven by that of the Brownian motion component, 
since the truncation effectively eliminates the finite-activity jumps. For very 
small values of a, a < 5, S n starts below 2 as predicted by the theoretical 
behavior of the continuous martingale part of X. Once a gets above 5, the 
theoretical limit of S n will then remain at 2 as long as a is not so large as 
to start including some jumps. 

Figure 2 plots the limiting value of S n (as a function of a again) un- 
der H\, where the model consists of a Brownian and a Cauchy components 
as described above. The upper left graph is the theoretical behavior of S n 
for the Brownian component taken in isolation, the upper right graph the 
theoretical behavior for the Cauchy component taken in isolation, the lower 
left graph the theoretical behavior for the sum of the two components, that 
is, the model we simulate from, and the lower right graph the correspond- 
ing results from the Monte Carlo simulations. When the Cauchy process is 
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Distribution of S Under the Null and Alternative Hypotheses 



density 



12 - 



8 




6 - 



2 - 



S 



0.8 



1.2 



1 . 4 



1 . 6 



1 . 8 



2 . 



2.2 



2.4 



Fig. 3. Monte Carlo distributions of the unstandardized test statistic S„ under 
Ho : f2y fl Qjf (right histogram) and Hi : fi^? (left histogram). 

present, the three curves in the figure correspond to the low (solid curve), 
medium (long dashes) and high (short dashes) scale of the jump component 
relative to the continuous component. For very small values of a, such as 
a < 5, the behavior of S n tracks that of the Brownian component, which 
is increasing in a toward 2. As a increases, the curves then start revers- 
ing course and tend to 1 instead, the limit driven by the infinite-activity 
jump process as shown in (35). The higher the scale of the jump compo- 
nent relative to the continuous component, the more the curve in the lower 
left graph approximates the corresponding one in the upper right graph, 
that is, the more it resembles that of a pure Cauchy process. For the Monte 
Carlo results, the low, medium and high jump scales are represented by cir- 
cles, squares and diamonds, respectively. In all three cases, they track the 
predicted theoretical limits closely for each value of the truncation level a. 

Finally, we report in Figure 3 histograms of the values of the unstan- 
dardized S n computed under Hq, centered in 2 as expected from (34), and 
Hi, centered in 1 as expected from (35), respectively. Figure 4 reports the 
Monte Carlo distribution of the statistic S n , standardized according to (36), 
compared to the limiting A/"(0, 1) distribution. 

Next, we turn to the symmetric problem, that of testing of Hq : £l l J^ vs. 
Hx-.Q^CWfig , which we do using the statistic S' n . The results are reported in 
Table 2 for the test rejection rate under the null hypothesis. Figure 5 shows 
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Standardized Distribution of S Under the Null Hypothesis 
Finite Activity Jumps 

density 



0.4- 



0.3- 



0.2- 



0.1- 




-4 -2-1012 4 



Fig. 4. Monte Carlo and asymptotic distributions of the standardized test statistic S„ 
under Ho : £1^ n . 

the limiting value of S' n under Hq, as expected from the theoretical limit 
of 7 P ~ p = 2 given in (41) (lower left graph for the Brownian plus Cauchy 
model) and the corresponding average value of S' n from the Monte Carlo 
simulations (lower right graph). The upper graphs in the figure correspond 
to the Brownian alone (left, with a limit of 1) and Cauchy alone (right, with 
a limit of 2) situations. As was the case for the previous test, higher values of 
the jump scale parameter make the resulting semimartingale model approxi- 



Table 2 

Monte Carlo rejection rate for the test of infinite jump activity 
using the statistic S' n (testing Ho'.Qlf vs. Hi:Q^, using S' n ) 



Infinite 


Test 


Test 


jump activity 


theoretical level 


empirical level 


Low 


10% 


8.2% 




5% 


3.6% 


Medium 


10% 


8.9% 




5% 


3.8% 


High 


10% 


10.0% 




5% 


4.9% 
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Behavior of S' Under the Null Hypothesis 
H : Infinite Jump Activity 



Brownian Alone 
Theoretical Behavior of S' 
S' 



Cauchy Alone 
Theoretical Behavior of S' 

S' 





Brownian + Cauchy 
Theoretical Behavior of S' 



Brownian + Cauchy 
Monte-Carlo Behavior of S' 



,!:::: s 5 ! I I 



Fig. 5. Behavior of the test statistic S' n under Hq:Q,^ for varying degrees of trunca- 
tion a. 



mate the behavior of its infinite-activity jump component more closely, while 
for lower scale parameters and low values of a, the behavior is determined 
by the continuous component. In all cases, we find that the Monte Carlo 
results track the predicted theoretical limits for each value of the truncation 
level q. Compared to Figure 2, we find that the limit is approached with less 
precision than was the case of S n , and requires larger values of a than was 
the case for the other test when the infinite-activity jump process is mixed 
with the continuous component. 

Figure 6 shows the point limit results for S' n under H\. There, the theo- 
retical limit of 1 from (42) is reached quickly, as was the case in Figure 1. 
This is the case provided that the values of a are not so large that jumps 
start being retained in the calculations. In particular, the value of S' n can 
become large if a is such that a jump is just included at the u n cutoff but 
not yet at the ^u n cutoff. But in most cases the statistic is very close to 1 
when the number of jumps is finite, simply because once n is large enough 
for the truncation level to have eliminated all the jumps at the higher level 
of truncation "yu n , then there are very few Brownian increments between 
truncation levels u n and 7« n ; therefore B(p,"fu n , A n )x is only marginally 
larger than B{p,u n ,A n )j<, and similarly for power p' . 
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Behavior of S' Under the Alternative Hypothesis 
Hi : Finite Jump Activity 



Brownian + Compound Poisson 
Theoretical Behavior of S 1 

S' 



Brownian + Compound Poisson 
Monte-Carlo Behavior of S 1 

s 



Fig. 6. Behavior of the test statistic S' n under Hi : Q. T l~l for varying degrees of trun- 
cation a. 



Figure 7 reports the Monte Carlo distributions of S' n under Hq and Hi; 
they are centered at 2 and 1 as expected. Under H±, we note that S' n displays 
very little variability with the provision above regarding the value of a. As 
a general rule, the test based on S' n appears more sensitive to values of a 
than the test based on S n . Figure 8 reports the Monte Carlo and asymptotic 
distribution of the standardized S' n under Hq, according to (43). 



Distribution of S' Under the Null and Alternative Hypotheses 

density 



1.2 1.4 1.6 



2.2 2.4 



Fig. 7. Monte Carlo distributions of the unstandardized test statistic S' n under Ho- 
(right histogram) and Hi (left histogram). 
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Standardized Distribution of S 1 Under the Null Hypothesis 
Infinite Activity Jumps 

density 



0.2 - 



0.1 - 




-4 -2-1012 4 

Fig. 8. Monte Carlo and asymptotic distributions of the standardized test statistic S' n 
under Ho : Q, 1 ^ . 

Finally, we examine the accuracy of the tests as a function of the sampling 
frequencies. We consider for this purpose two experiments. In the first one, 
we fix the observation length to T = 5 days (one week) and consider sam- 
pling frequencies from 5 seconds to 5 minutes. The sample size n therefore 
decreases by a factor 60 over the range of values of A n considered. In the 
second experiment, we consider the same sampling frequencies but increase 
the length of the observation window T from 5 days to 1 year, in order to 
keep the sample size approximately constant over the range of values of A n . 
The results of the first experiment are reported in Figures 9 and 10, respec- 
tively. We find little size distortion or power loss. The results of the second 
experiment are, not surprisingly, better and are not shown here in order to 
save space. 

5. Empirical results. In this section, we apply our two test statistics 
to real data, consisting of all transactions recorded during the year 2006 on 
two of the most actively traded stocks, Intel (INTC) and Microsoft (MSFT). 
The data source is the TAQ database. Using the correction variables in the 
data set, we retain only transactions that are labeled "good trades" by the 
exchanges: regular trades that were not corrected, changed, or signified as 
canceled or in error; and original trades which were later corrected, in which 
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Test Statistic S 
Significance Level Under H a and Power Against Hi 
for Different Sampling Frequencies 
H : Finite Activity, H x : Infinite Activity 



Power Against 



Rejection Rate at 10% and 5% Levels 



5s 30s lmn 2mn 5mn 

Fig. 9. Level and power of the test based on the statistic S as a function of the sampling 
frequency. 



case the trade record contains the corrected data for the trade. Beyond that, 
no further adjustment to the data is made. 

We first consider the test where the null hypothesis consists of finite jump 
activity. Figure 11 shows the values of the test statistic S n , along with a 95% 
confidence interval computed from the asymptotic distribution (36) under 
the finite-activity null hypothesis, plotted for a range of values of the trunca- 
tion index a. The truncation level plays the same role as that of a bandwidth 
parameter in classical nonparametric estimator, and it is therefore impor- 
tant that it be properly scaled initially. For this purpose, the values of a are 
indexed in terms of standard deviations of the continuous martingale part 
of the log-price: we first estimate the volatility of the continuous part of 

1 12 

X using the sum of squared increments that are smaller than A n (mean- 
ing that we would retain the increments of up to four standard deviations 
if the annualized volatility of the stock were 25% per year) and then use 
that estimate to form the initial cutoff level used in the construction of the 
test statistic. To account for potential time series variation in the volatility 
process at, that procedure is implemented separately for each day and we 
compute the sum, for that day, of the increments that are smaller than the 
cutoff, to the appropriate power p required by the test statistic. For the full 
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Test Statistic S 1 
Significance Level Under H and Power Against Hi 
for Different Sampling Frequencies 
H : Infinite Activity, H 1 : Finite Activity 



Power Against 



0.5- 



Rejection Rate at 10% and 5% Levels 



5s 30s lmn 2mn 5mn 

Fig. 10. Level and power of the test based on the statistic S' as a function of the sampling 
frequency. 



year, we then add the power variations computed for each day. We then 
compute the results corresponding to a range of values of u n indexed by a. 

In order to account for the presence of market microstructure noise in the 
data, we compute the limiting values of our test statistics in the case where 
the noise is a pure additive noise; the limit is then 1/k, independent of p. We 
also consider the dependence of the test statistics as the sampling interval 
increases, and the signal-to-noise ratio presumably improves. 

The test statistic is implemented with p = 4 and k = 2, using A n = 5 
seconds in the upper panels, and A n = 10 seconds in the lower panels. As a 
result, from Theorem 1, S n should go to k p l 2 ~ l = 2 under the null of finite 
activity, and to 1 under the alternative of infinite activity. As the plots show, 
we find that S n is close to 1, which leads us to reject the null hypothesis of 
finite activity. 

Next, we turn to the test where the null hypothesis consists of infinite 
jump activity. Figure 12 shows the values of the test statistic S' n , along with 
a 95% confidence interval computed from the asymptotic distribution (43) 
under the infinite-activity null hypothesis, plotted for a range of values of 
the truncation index a. The two curves correspond to values of 7 = a' /a 
equal to 2.0 and 2.5, respectively. The test statistic is implemented with 
p' = 4 and p = 3, using A n = 5 seconds in the upper panels, and A n = 10 
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Fig. 11 
MSFT, 



Test Statistic S 
All Transactions 2006 



INTC, A = 5 seconds 



8 9 10 

Truncation Level a 

INTC, A = 10 seconds 



Truncation Level a 
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5 seconds 



8 9 10 

Truncation Level a 

MSFT, A = 10 seconds 



Truncation Level a 



Values of the test statistic S n computed for all 2006 transactions of INTC and 
sampled at 5 and 10 second intervals. 



seconds in the lower panels. Recall from Theorem 4 that under the null of 
infinite activity, S' n should go to 7 P ~ p = 2.0 and 2.5, respectively, for the 
two curves, and to 1 under the alternative of finite activity. We find that S' n 
is close to the predicted value 7 P ~ p , which leads us to not reject the null 
hypothesis of infinite activity. 

To summarize, the answer from both tests appears indicative of infinite 
jump activity in those data: using S n , we reject the null of finite activity, 
while using S' n we do not reject the null of infinite activity. Finally, we 
illustrate in Figure 13 the convergence of S n to 1 as the sampling interval 
decreases, indicating that the null hypothesis of finite activity is rejected 
when high-frequency data (of the order of seconds) are used. On the other 
hand, we see that using longer sampling intervals (of the order of minutes) 
makes it impossible to reject the null hypothesis of finite activity using S n . 
This is compatible with the fact that small jumps occurring over short time 
intervals can be aggregated or smoothed out over longer time intervals. For 
the second testing situation, Figure 14 shows the convergence of S' n to 2 
as the sampling interval decreases from 30 minutes to 5 seconds, indicating 
that the null hypothesis of infinite activity is not rejected at high frequency. 
As in the first test, lower frequency data tend to be more compatible with 
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Test Statistic S' for Truncation Ratios a'/o = 2.0 and 2.5 
All Transactions 2006 
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Fig. 12. Values of the test statistic S' n computed for all 2006 transactions of INTC and 
MSFT, sampled at 5 and 10 second intervals. 



Convergence of Test Statistic S 
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Fig. 13. Box plot for the test statistic S n computed for all 2006 transactions of INTC 
and MSFT, sampled at time intervals ranging from 5 seconds to 30 minutes. 
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Convergence of Test Statistic S 1 for Truncation Ratio a' /a = 2.0 
All Transactions 2006 
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Fig. 14. Box plot for the test statistic S' n computed for all 2006 transactions of INTC 
and MSFT, sampled at time intervals ranging from 5 seconds to 30 minutes. 



finite jump activity. In both cases, longer sampling intervals A n over the 
same sampling length T lead to a reduction in the sample size n = T/A n , 
which generally leads to an increase in the variance of the test statistic, 
making it more difficult ceteris paribus to reject the null hypothesis. 
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SUPPLEMENTARY MATERIAL 

Supplement to "Testing whether jumps have finite or infinite activity" 

(DOI: 10.1214/11-AOS873SUPP; .pdf). This supplementary article contains 
a few additional technical details about the assumptions made in this paper, 
and the proofs of all results. 
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